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ABSTRACT 


Nonparametric  techniques  used  to  solve  the  change  point 
problem  are  applied  to  the  problem  of  texture  classification. 
The  texture  classification  problem  is  not  formulated  as  a  hy¬ 
pothesis  testing  problem,  but  instead  our  interest  lies  in 
the  values  of  K^,  K+,  and  K",  the  change  point  statistics. 
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1. 


Introduction 


Although  the  problem  of  texture  classification  has  been 
thoroughly  studied,  it  is  still  desirable  to  investigate 
whether  or  not  we  can  model  the  random  behavior  of  the  imaging 
equipment  and  still  achieve  a  reliable  classification  scheme. 

To  this  end,  we  introduce  a  statistical  test  previously  used 
to  determine  whether  or  not  a  sequence  of  random  variables 
has  a  change  point,  i 

Definition:  A  change  point  is  defined  to  be  an  index  t  in 

a  sequence  x1,x2,...,xT  of  random  variables  such  that  x^,x2,..., 
Xj.  have  a  common  distribution  F^tx)  and  x  +^,...,x_  have  a 
common  distribution  F2  (x)  ,  where  F.(x)^F2(x). 

Note  that  there  is  no  change  point  if  x=T. 

Determining  whether  or  not  a  change  point  exists  in  a 
sequence  of  random  variables  is  related  to  two  texture  classi¬ 
fication  methods  described  in  Weszka  et  al.  [1] .  From  the 
description  of  the  change  point  statistics  in  Section  2,  this 
association  will  become  clearer. 
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2.  Change  point  statistics 

Many  authors  nave  presented  approaches  to  solving  the 
change  point  problem.  These  include  tests  for  a  change  in 
mean  level  (Sen  and  Srivastava  [2,6]),  likelihood  ratio 
tests  (Hinkley  [3]),  Bayesian  approaches  to  inference  about 
t  (Smith  [4]),  and  distribution-free  approaches  as  in 
McGilchrist  and  Woodyer  [5]  or  Sen  and  Srivastava  [2] . 

We  desire  to  use  a  method  which  makes  no  assumptions 
about  the  initial  distribution.  Thus  we  consider  a  version 
of  the  Mann-Whitney  U  statistic.  This  statistic  can  be  used 
for  testing  the  hypotheses  of  no  change  point  versus  change 
point  at  r. 

Let  us  now  examine  the  Mann-Whitney  statistic  for  testing 
if  two  samples  (x1»x2, . . .xfc  and  xfc+1,...,xT)  come  from  the 
same  population.  The  statistic  U.  „  is  defined  as 

"C  f  1 


u 


t,T 


t  T 
=  I  E 
i=l  j=i+l  ij 


where 


D± j  =  sgn(xi-xj) 

To  use  the  above  statistic  to  solve  the  change  point  prob¬ 
lem,  and  for  our  purposes,  we  let  t  vary  such  that  lst<T.  Then 
we  introduce  the  following  statistics: 

max 


Kt  =  l<t<T  |Ufc  T| 


,  max 

Kt  -  l<t<T  UtfT 


(3) 

(4) 


K 


T 


t,T 


-min 
lst<T  U 


(5) 


which  are  the  definitions  in  Pettitt  [7] .  We  refer  to 
K^,  K+,  and  as  the  change  point  statistics.  It  is 
easy  to  see  that 

Kt  =  max(K+,  K") 

It  is  precisely  this  fact  which  we  wish  to  investigate. 

and  will  be  computed  along  the  columns,  rows,  and 
diagonals  of  an  image,  and  we  wish  to  see  whether  is  K* 
or  K^.  This  gives  an  indication  as  to  the  nature  of  the 
changes  in  sign  of  gray  levels  between  pixels  in  different 
portions  of  the  image.  Thus,  if  KT=K*,  the  majority  of  the 
changes  are  positive;  if  K^k”,  the  majority  of  the  changes 
in  sign  are  negative. 


.  The  relationship  between  change  point  statistics  and  other 


features 

In  this  section  we  wish  to  show  the  relationship  between 
the  change  point  statistics  and  other  texture  classification 
features  alluded  to  in  Section  1. 

Weszka  et  al.  [1]  also  form  differences  of  gray  levels, 
although  these  differences  are  absolute  differences  for  a 
given  displacement  <5 .  The  probability  density  of  these  differ¬ 
ences  is  estimated  by  the  number  of  occurrences,  and  various 
measures  are  computed  from  these  probability  estimates,  P^ . 

The  change  point  statistics  use  the  sign  of  the  difference  and 
thus  include  additional  information  relating  to  the  signs  of 
the  changes  in  gray  level.  Also,  the  displacement  is  varied 
up  to  a  given  limit  (l<t<T)  and  so  information  is  included  over 
various  displacement  values,  6. 

The  survey  by  Haralick  [8]  as  well  as  Weszka  [1]  discuss 
the  use  of  run  length  statistics.  This  is  simply  counting 
the  number  of  pixels  with  the  same  gray  level.  This,  too,  is 
included  in  the  change  point  statistics,  since  the  run  length, 
t,  is  varied  between  1  and  an  upper  bound  of  T.  Pixels  of  the 
same  gray  level  in  a  run  add  0  to  the  sum  U  _  by  definition 
of  the  sign  function. 

The  gray-tone  spatial  dependence  matrices  of  Haralick  [8] 
and  Haralick  et  al.  [9]  are  used  to  model  the  probabilistic 
behavior  of  texture.  What  is  not  mentioned  is  the  fact  that 


there  are  underlying  assumptions  in  using  Pearson  product 

t 

moment  correlation,  and  these  statistics  cannot  be  arbi¬ 
trarily  applied.  Siegel  [10]  indicates  that  the  Pearson 
product  moment  correlation  requires  scores  in  at  least  an 
equal  interval  scale,  and  that  the  scores  be  from  a  bivariate 
normal  population.  If  we  are  dealing  with  gray  levels,  the 
first  assumption  is  satisfied  (that  of  equal  scale) ,  but 
the  second  assumption  of  normality  may  not  be  met. 

Thus  we  have  chosen  to  use  distribution-free  statistics 
where  no  assumptions  other  than  continuity  of  F(x)  are  made. 

This  is  even  weaker  than  the  normality  assumption,  since 
the  continuity  assumption  is  underlying  the  assumption  of 
normality. 

Here  we  have  shown  similarities  and  differences  between 
change  point  statistics  and  previous  methods  for  texture  classi¬ 
fication.  The  similarities  indicate  that  we  can  expect  simi¬ 
lar  results  with  change  point  statistics,  yet  the  statistical 
differences  indicate  that  a  larger  sample  size  is  needed  for 
the  same  level  of  significance.  The  reader  is  referred  to 
Randles  &  Wolfe  [11]  for  a  discussion  of  asymptotic  relative 
efficiency  of  distribution-free  statistics. 


4*  The  computation  of  change  point  statistics 

Recall  that  in  order  to  compute  kt#  K*,  and  K~,  it  is 
necessary  to  compute 


0t-T  =  til  i-L  a) 

for  values  of  t  such  that  l<t<T,  where  T  is  fixed,  in  the 
results  presented  here,  several  values  of  T  were  used  only 
to  see  if  there  existed  a  value  T  beyond  which  the  results 

did  not  vary,  in  this  work  both  t  and  T  represented  positions 
within  an  image. 

The  above  formula  can  be  computationally  expensive.  Empiri¬ 
cal  results  show  that  for  values  of  T>10,  computation  can  ex¬ 
ceed  10  minutes.  As  Haralick  et  al.  [9]  indicate,  it  is  de¬ 
sirable  to  use  a  method  that  is  computationally  feasible.  We 
present  an  alternative  to  equation  (1)  that  is  briefly  men¬ 
tioned  in  Pettitt  [7] ,  but  is  not  fully  discussed. 

In  equation  (1) ,  note  that 

t-1  T  T 

Ut,T  =  •E,  •  E  .  Dij  +  1  Dti  (7) 

Now  1=1  3=t+l  3  j=t+l 

t-1  T  t-1  T  t-1 

Z  I  D.  .  =  Z  ZD..—  ZD.  (8^ 

i-1  j-t+1  13  i-1  j-t  13  i-i  (8) 

Substituting  (8)  into  (7): 
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Notice  that  U, 


t-1  T 

=  £  £  D. •  by  definition, 

i=l  j=t 
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£  D  .  by  symmetry  of  the  sign  function 
i=l  tx 
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Also  since  Dtt~0  by  definition,  we  arrive  at  our  final  recur¬ 
sion  formula 


Ut,T  =  Ut-1,T  +  Jl^tj 


(12) 


Hence  our  computation  can  be  speeded  up  by  storing  the  previous 

values  of  T  for  the  computation  of  Ufc  T-  The  empirical 

results  indicate  that  using  this  formula  allowed  us  to  compute 

U  for  up  to  T=20  in  under  two  minutes, 
t  f  T 

Be  careful  to  note  that  the  above  equations  still  only  deal 
with  one  dimensional  random  variables.  In  order  to  apply  these 
statistics  to  images,  it  is  necessary  to  make  a  two-dimensional 
adjustment.  We  have  done  this  in  the  following  manner: 

When  computing  U.  along  the  rows  (direction  "I")  let 

w  **• 


Dij  =  kz1s^n(xik  '  xjk}  (13) 

where  n  is  the  number  of  rows.  When  computing  Ufc  T  along  the 
columns  (direction  "J") ,  we  use  a  similar  equation 


(14) 


P 


D 


J 

ij 


n 

^sgn^i  -  xkj) 


A  slightly  different  equation  is  used  to  compute  along 
the  image  diagonals.  We  desire  to  capture  information  from 
the  diagonals  throughout  the  image,  not  the  longest  diagonals 
only.  This  is  accomplished  by  varying  the  slope  of  the  dia¬ 
gonal.  We  arrive  at 


-  x(n+1_k)j> 


(15) 


where  K  denotes  the  diagonal  axis.  One  can  easily  see  that 
the  differencing  takes  place  along  a  line  whose  slope  is  varied, 
and  is  the  sum  of  such  differences. 

By  computing  D.1. ,  D.J.  ,  and  D.K. ,  we  will  have  U*  _,  uf  and 

ijlj  J.J  X. ,  1  u ,  1 

K  + 

T.  With  these  U  statistics  in  hand,  we  can  compute  , 

Kt  and  eventually  =  max  ( ,  K^, )  for  each  of  the  directions 

X  iJ  K 

I,J,  and  K.  The  triple  (K^K.pjK.p)  is  used  to  characterize  a 
given  image. 

In  practice,  instead  of  using  the  values  of  the  K^,  we  use 

instead  the  sign  (+/-)  of  the  change  point  statistic  (K*  or  K~) 

T 

that  K  equals.  Thus  for  each  sample  we  have  a  triple  of  direc¬ 
tions  indicated  by  signs.  Intuitively,  these  signs  indicate 
whether  the  gray  levels  increase  or  decrease  in  value  in  the 
specified  directions.  If  a  sign  is  positive  (negative) ,  this 
means  the  difference  in  gray  level  is  positive  (negative) ,  and 
the  gray  level  has  increased  (decreased)  in  value.  The  following 
sections  describe  some  results  of  using  change  point  statistics 
in  practice. 


5.  Experimental  results 

This  experiment  consisted  of  two  parts:  First,  we  want 
to  see  if  change  point  statistics  can  be  used  in  texture 
classification,  and  secondly  we  are  interested  in  determining 
if  it  is  necessary  to  let  T  approach  the  value  of  n  for  an 
n*n  image.  This  is  motivated  by  the  fact  that  the  computation 
of  U  is  so  costly,  and  if  T=n,  we  have  a  combinatorial  ex- 

t  /  T 
plosion. 

The  results  are  presented  in  the  following  format:  For 
each  of  the  directions  I,J,  and  K  we  compute  K*  and  K^,  by 
equations  (4)  and  (5)  .  With  and  K^,  we  compute  KT  by  equa¬ 
tion  (6) .  For  each  sample  and  axis,  we  record  either  +  or  - 
depending  on  whether  KT=K+  or  K~,  respectively.  Thus  for  each 
sample  we  have  the  triple  of  change  point  statistics  (values 
of  Kt)  represented  by  +  or  -.  These  are  what  we  use  to  differ¬ 
entiate  the  texture  samples. 

For  the  second  part  of  the  experiment,  we  let  T=5,10,  and  20. 
The  purpose  of  this  is  twofold:  We  desire  to  see  if  the  com¬ 
putation  can  be  completed  and  then  we  use  these  results  as  the 
training  sets  for  classification.  The  classification  results 
for  the  different  values  of  T  are  compared  to  see  if  the  classi¬ 
fications  are  the  same. 

The  results  are  tabulated  in  Figures  1,2,  and  3  for  T=5,10, 
and  20,  respectively.  In  this  work,  we  have  used  texture  samples 
of  Mississippian  Limestone  and  Shale (ML),  Pennsylvanian  Sandstone 
and  Shale (PS),  and  Lower  Pennsylvanian  Shale (LP).  Ten  samples 


of  each  texture  were  used. 


A  texture  classification  decision  rule  similar  to  that  of 
Haralick  et  al.  [9]  cam  be  employed  here.  For  each  of  the 
three  groups  (ML,PS,LP)  we  classify  samples  of  that  group 
by  the  most  likely  triple  that  occurs.  For  example,  if  a 

change  point  triple  is  +++  or  -  ("flat")  it  is  most  likely 

to  be  PS(see  Figure  2) . 

Again,  looking  at  Figure  2,  the  majority  of  the  ML  samples 
have  a  -  for  the  I  axis  whereas  the  LP  samples  have  a  +  for 
the  I  axis.  Therefore,  if  a  sample  does  not  exhibit  flat 
structure  as  defined  above,  we  classify  it  as  ML  or  LP  depending 
on  the  I  axis  statistic. 

To  test  this  decision  rule,  10  samples  were  chosen  randomly 
from  among  the  ML, PS,  and  LP  windows.  Their  change  point  sta¬ 
tistics  were  computed,  and  they  were  classified  accordingly. 
Their  actual  types  are  also  given  in  Figure  4. 

The  second  part  of  this  experiment  was  to  determine  the 
accuracy  of  using  a  small  value  of  T.  Each  of  the  training 
sets  were  used  to  classify  the  samples  and  the  differences  were 
examined.  It  is  important  to  note  that  a  preallotea  amount 
of  time  was  given  to  compute  the  training  set  statistics.  This 
was  done  to  see  if  the  statistics  could  be  computed  in  a  reason¬ 
able  amount  of  time. 


6.  Conclusion 


In  this  paper  we  have  presented  a  new  method  for  texture 
classification.  It  has  been  shown  that  the  use  of  change 
point  statistics  is  very  similar  to  other  information  pre¬ 
viously  used  in  texture  classification,  yet  change  point  sta¬ 
tistics  maintain  the  distinct  advantage  of  using  distribution- 
free  statistics.  That  is  to  say,  our  model  of  image  and  noise 
does  not  assume  any  form  of  the  noise  distribution.  The  proba¬ 
bility  distribution  of  the  change  point  statistics  themselves 
is  independent  of  the  probability  distribution  of  the  under¬ 
lying  random  variable. 

Figures  1,2,  and  3  show  the  change  point  statistics  for 
the  training  sets.  Since  for  T=20,  the  computation  did  not 
finish,  we  conclude  that  this  computation  is  prohibitive,  even 
when  we  use  the  recursion  equation  and  store  intermediate  re¬ 
sults.  We  still  wish  to  investigate  whether  or  not  the  classi¬ 
fier  is  accurate  in  going  from  T=5  to  T=10. 

We  see  from  Figure  4  that  the  classifier  using  change  point 
statistics  is  about  90%  accurate  for  the  samples  given.  This 
is  not  entirely  conclusive  since  it  is  still  desirable  to  classi 
fy  with  the  T=20  training  set.  Clearly,  one  future  direction 
is  to  derive  an  algorithm  even  faster  than  equation  (12)  and  a 
better  search  of  the  U  „  space  for  K*  and  K~.  Also,  one  could 

u  ^  1  1  X 

extend  this  work  by  training  on  a  larger  number  of  texture  types 
We  have  shown  that  change  point  statistics  are  valuable  as  an 
aid  in  texture  classification. 
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Figure  1.  Change  point  statistics  for  T=5. 
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Figure  2.  Change  point  statistics  for  T=10 
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Figure  3.  Change  point  statistics  for  T=20 
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